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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

APPLICANT: Nelson, Edward L. 

Nelson, Peter J. 

TITLE OF INVENTION: NOVEL VECTOR FOR 
POLYNUCLEOTIDE VACCINES 

NUMBER OF SEQUENCES: 2 9 

CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: MORGAN & FINNEGAN, L.L.P. 

(B) STREET: 34 5 PARK AVENUE 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: USA 

(F) ZIP: 10154 

COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3.5 INCH, 1.44 MB 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WORDPERFECT 5.1 



CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US97/143 0 6 

(B) FILING DATE: 14-AUG-1997 

PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US60/023931 

(B) FILING DATE: 14-AUG-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: KATHRYN M. BROWN 

(B) REGISTRATION NUMBER: 34556 

( C ) REFERENCE/DOCKET NUMBER : 2026-423 6US 1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 758-4800 

(B) TELEFAX: (212) 751-6849 

(C) TELEX: 421792 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



ru 



(i) 

(ii) 

(iii) 
(iv) 



(v) 



(vi) 



(vii) 
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76 



( i i i ) H Y POTHET I CAL : No 
( iv) ANTI - SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1 : 



40 
80 



GGCCGCGTTG CTGGCGTTTT TCCATAGGCT CCGCCCCCCT 
GACGAGCATC AC AAAAAT CG ACGCTCAAGT CAGAGGTGGC 

GAAAC CCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC 12 0 

TGGAAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG 16 0 

CTTACCGGAT ACCTCTCCGC CTTTCTCCCT TCGGGAAGCG 200 

TGGCGCTTTC TCAATGCTCA CGCTGTAGGT ATCTCAGTTC 24 0 

GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG TGTGCACGAA 280 

CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT 3j0 

B ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC 36 0 

ACTGGCAGCA GCCACTGGTA ACAGGATTAG CAGAGCGAGG 400 

nj TATGTAGGCG GTGCTACAGA GTTCTTGAAG TGGTGGCCTA 

jf ACTACGGCTA CAC 



400 
440 
453 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TO POLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : No 

(iv) ANTI -SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 : 

GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACTCTGTAG 40 

CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT 8 0 

GGCTGCTGCC AGTGGCGATA AGTCGTGTCT TACCGGGTTG 12 0 

GACTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG 16 0 

GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG 200 

AACGACCTAC ACCGAACTGA GATACCTACA CCGTGAGCAT 240 

TGAGAAAGCG CCACGCTTCC CGAAGGGAGA AAGGCGGACA 2 80 

GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC 320 

GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA TCTTTATAGT 360 

CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGTCGATTTT 4 00 
TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC 
CAGCAACGCG GCC 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 base pairs 



440 
453 
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(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

( iv) ANTI - SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 3 : 

GAATTCTTTC GGACTTTTGA AAGTGATGGT GGTGGCCGAA 4 0 

GGATTCGAAC CTTCGAAGTC GATGACGGCA GATTTAGAGT 80 

CTGCTCCCTT TGGCCGCTCG GGAACCCCAC CACGGGTAAT 12 0 

GCTTTTACTG GCCTGCTCCC TTATCGGGAA GCGGGGCGCA 16 0 

C T CAT AT C AAA TGACGCGCCG CTGTAAAGTG TTACGTTGAG 200 

^0 AAAGAATTC 

pj (2) INFORMATION FOR SEQ ID NO: 4: 

Li (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 09 base pairs 

s (B) TYPE: nucleic acid 

M= (C) STRANDEDNESS: single 

y± (D) TOPOLOGY: unknown 

O 

[I (ii) MOLECULE TYPE: cDNA 

'tf (iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 : 

GAATTCTTTC TCAACGTAAC ACTTTACAGC GGCGCGTCAT 4 0 

TTGATATGAT GCGCCCCGCT TCCCGATAAG GGAGCAGGCC 80 

AGTAAAAGCA TTACCCGTGG TGGGGTTCCC GAGCGGCCAA 12 0 

AGGGAGCAGA CTCTAAATCT GCCGTCATCG ACTTCGAAGG 16 0 

TTCGAATCCT TCCCCCACCA CCATCACTTT CAAAAGTCCG 2 00 

AAAGAATTC 209 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 5 
AATAAA 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 6 



ATTAAA 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 



AGTAAA 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 8 : 



AAGAAC 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) H Y POTHET I CAL : No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 9 : 



AATACA 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10 



GCCTTAAGGG CCATATGGTG AGTGGATCCC TTGACCCCAG 
GCGGGGATGG GGAGACCTGT AGTCAGAGCC CCCGGGCAGC 
ACAGGCCAAT GCCCGTCCTT CCCCTGCAGG ATGAGTAGTG 
AGTGCCTCTC CTGGCCCTGG AAGTTGCCAC TCCAGTGCCC 
ACCAGCCTTG TCCTAATAAA ATTAAGTTGC ATCATTTTGT 
CTGACTAGGT GTCCTCTATA ATATTAT 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 7 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11 



ATAATATTAT AGAGGACACC TAGTCAGAAC AAATGATGCA 
ACTTAATTTT ATTAGGACAA GGCTGGTGGG CACTGGAGTG 
GCAACTTCCA GGGCCAGGAG AGGCACTCAC TACTCATCCT 
GCAGGGGAAG GACGGGCATT GGC CTGTGCT GCCCGGGGGC 
TCTGACTACA GGTCTCCCCC ATCCCCGCCT GGGGTCAAGG 
CATCCACTCA CCATATGGCC CTTAAGG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12 



CCTCGGTACC TGCCATGGCG CGGATTCTTT ATCACTGATA 

AGTTGGTGGA CATATTATGT TTATCAGTGA T AAAGTGT C A 

AGCATGACAA AGTTGCAGCC GAATACAGTG ATCCGTGCCG 

GCCCTGGACT GTTGAACGAG GTCGGCGTAG ACGGTCTGAC 

GACACGCAAA CTGGCGGAAC GGTTGGGGGT GCAGCAGCCG 

GCGCTTTACT GGCACTTCAG GAACAAGCGG GCGCCTTAAG 
GGCCATATGC CG 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: cDNA 

( i i i ) H Y POTHET I CAL : No 
(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 



CCTCGGTACC TGCCACCATG GCGCGGATTC TTTAT 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TO POLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
( i i i ) HYPOTHET I CAL : No 
(iv) ANTI -SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14 



CGGCATATGG CCTTAAGGCG CCCGCTTGTT CCTGAAGT 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
( i i i ) HYPOTHETI CAL : No 
(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15 



GCCTTAAGGG C C AT ATGGTG AGTGGATGCC TTGACCCCAG 
GCGGGGATGG GGGAGACCTG TAGTCAGAGC CCCCGGGCAG 

CACAGGCCAA TGCCCGTCCT TCCCCTGCAG GATGAGTAGT 
GAGTGCCTCT CCTGGCCCTG GAAGTTGCCA CTCCAGTGCC 
CACCAGCCTT GTCCTAATAA AATTAAGTTG CATCATTTTG 
TCTGACTAGG TGTCCTCTAT AATATTAT 
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(2) INFORMATION FOR SEQ ID NO: 16: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(ii) 



MOLECULE TYPE: 



cDNA 



(iii) 



HYPOTHETICAL: 



No 



(iv) 



ANTI -SENSE : 



NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 16: 



TGCCATGGCG CGGATTCTTT AT C ACTGAT A AGTTGGTGGA 
CATATTATGT TTATCAGTGA TAAAGTGTCA AGCATGACAA 
AGTTGCAGCC GAATACAGTG ATCCGTGCCG GCCCTGGACT 
GTTGAACGAG GTCGGCGTAG ACGGTCTGAC GACACGCAAA 
CTGGCGGAAC GGTTGGGGGT GCAGCAGCCG GCGCTTTACT 
GGCACTTCAG GAACAAGCGG GCGCCTTAAG GGCCATATGG 
TGAGTGGATG CCTTGACCCC AGGCGGGGAT GGGGGAGACC 
TGTAGTCAGA GCCCCCGGGC AG C AC AGGC C AATGCCCGTC 
CTTCCCCTGC AGTGAGTAGT GACTGCCCGG GTGGGATCCC 
TGTGACCCCT CCCCAGTGCC TCTCCTGGCC CTGGAAGTTG 
CCACTCCAGT GCCCACCAGC CTTGTCCTAA TAAAATTAAG 
TTGCATCATT TTGTCTGACT AGGTGTCCTC TATAATATTA 
TAAGCTTGAT ATCGAATTCT TTCTCAACGT AACACTTTAC 
AGCGGCGCGT CATTTGATAT GATGCGCCCC GCTTCCCGAT 
AAGGGAGCAG GCCAGTAAAA GCATTACCCG TGGTGGGGTT 
CCCGAGCGGC CAAAGGGAGC AGACT CT AAA TCTGCCGTCA 
TCGACTTCGA AGGTTCGAAT CCTTCCCCCA CCACCATCAC 
TTTCAAAAGT CCGAAAGAAT TCCTGCAGCC CGTGTAGCCG 
TAGTTAGGCC ACCACTTCAA GAACTCTGTA GCACCGCCTA 
CATACCTCGC TCTGCTAATC CTGTT AC C AG TGGCTGCTGC 
CAGTGGCGAT AAGTCGTGTC TTACCGGGTT GGACTCAAGA 
CGATAGTTAC CGGATAAGGC GCAGCGGTCG GGCTGAACGG 
GGGGTTCGTG CACACAGCCC AGCTTGGAGC G AACGAC CT A 
CACCGAACTG AGATACCTAC AGCGTGAGCA TTGAGAAAGC 
GCCACGCTTC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG 
TAAGCGGCAG GGTCGGAACA GGAGAGCGCA CGAGGGAGCT 
TCCAGGGGGA AACGCCTGGT ATCTTTATAG TCCTGTCGGG 
TTTCGCCACC TCTGACTTGA GCGTCGATTT TTGTGATGCT 
CGTCAGGGGG GCGGAGCCTA TGGAAAAACG CCAGCAACGC 
GGCCGGGGGA TCCGGAGAGC T C ACT CT AGA TGAGAG AG C A 
GTGAGGGAGA GACAGAGACT CGAATTTCCG GAGCTATTTC 
AGTTTTCTTT TCCGTTTTGT GCAATTTCAC TTATGATACC 

GGCCAATGCT TGGTTGCTAT TTTGGAAACT CCCCTTAGGG 
GATGCCCCTC AACTGGCCCT ATAAAGGGCC AGCCTGAGCT 
GCAGAGGATT CCTGCAGAGG ATCAAGACAG CACGTGGACC 
TCGCACAGCC TCTCCCACAG GTACC 



40 
80 
120 
160 
200 
240 
280 
320 
360 
400 
440 
480 
520 
560 
600 
640 
680 
720 
760 
800 
840 
880 
920 
960 
1000 
1040 
1080 
1120 
1160 
1200 
1240 
1280 

1320 
1360 
1400 
1425 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 



ATGAGCAAGG GCGAGGAACT GTTCACTGGC 
TTCTCGTGGA ACTGGATGGC GATGTGAATG 
TTCTGTCAGC GGAGAGGGTG AAGGTGATGC 
AAGCTCACCC TGAAATTCAT CTGCACCACT 
CTGTGCCATG GCCAACACTG GTCACTACCT 
CGTGCAGTGC TTTTCCAGAT ACCCAGACCA 
CATGACTTTT TCAAGAGCGC CATGCCCGAG 
AGGAGAGAAC CATCTTTTTC AAAGATGACG 
GACCCGCGCT GAAGTCAAGT TCGAAGGTGA 
AATAGAATCG AGTTGAAGGG CATTGACTTT 
GAAACATTCT CGGCCACAAG CTGGAATACA 
C C AC AATGTG T AC AT C ATGG CCGACAAGCA 
AT C AAGGT C A ACTTCAAGAT CAGACACAAC 
GATCCGTGCA GCTGGCCGAC CATTATCAAC 
AATCGGCGAC CGCCCTGTGC TCCTCCCAGA 
CTGTCCACCC AGTCTGCCCT GTCTAAAGAT 
AGAGAGACCA CATGGTCCTG CTGGAGTTTG 
TGGG AT C AC A CATGGCATGG ACGAGCTGTA 



GTGGTCCCAA 4 0 

GGCACAAATT 8 0 

CACATACGGA 12 0 

GGAAAGCTCC 16 0 

TCACCTATGG 200 

TATGAACGAG 24 0 

GGCTATGTGC 2 80 

GGAACTACAA 320 

CACCCTGGTG 3 6 0 

AAGGAAGATG 40 0 

ACTATAACTC 44 0 

AAAGAATGGC 48 0 

ATTGAGGATG 52 0 

AGAACACTCC 56 0 

CAACAATTAC 6 00 

CCCAACGAAA 64 0 

TGACCGCTGC 68 0 

CAAGTGAGC 719 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18 
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TATGAGCAAG GGCGAGGAAC TGTTCACTGG CGTGGTCCCA 
ATTCTCGTGG AACTGGATGG CGATGTGAAT GGGCACAAAT 
TTTCTGTCAG CGGAGAGGGT GAAGGTGATG CCACATACGG 
AAAGCTCACC CTGAAATTCA TCTGCACCAC TGGAAAGCTC 
CCTGTGCCAT GGCCAACACT GGTCACTACC TTCACCTATG 
GCGTGCAGTG CTTTTCCAGA TACCCAGACC ATATGAAGCA 
GCATGACTTT TTCAAGAGCG CCATGCCCGA GGGCTATGTG 
CAGGAGAGAA CCATCTTTTT CAAAGATGAC GGGAACTACA 
AGACCCGCGC TGAAGTCAAG TTCGAAGGTG AC AC C CTGGT 
GAATAGAATC GAGTTGAAGG GCATTGACTT TAAGGAAGAT 
GGAAAC ATT C TCGGCCACAA GCTGGAATAC AACTATAACT 
CCCACAATGT GT AC AT CATG GCCGACAAGC AAAAGAATGG 
CAT C AAGGTC AACTTCAAGA T C AG AC AC AA CATTGAGGAT 
GGATCCGTGC AGCTGGCCGA C C ATT AT C AA CAGAACACTC 
CAATCGGCGA CGGCCCTGTG CTCCTCCCAG ACAACCATTA 
CCTGTCCACC CAGTCTGCCC GTCTAAAGAT CCCAACGAAA 
AGAGAGACCA CATGGTCCTG CTGGAGTTTG TGACCGCTGC 
TGGG AT C AC A CATGGCATGG ACGAGCTGTA CAAGTGAGCC 
ATATGGTGAG TGGATGCCTT GACCCCAGGC GGGGATGGGG 



CCCGTCCTTC CCCTGCAGTG AGTAGTGACT GCCCGGGTGG 
GATCCCTGTG ACCCCTCCCC AGTGCCTCTC CTGGCCCTGG 
AAGTTGCCAC TCCAGTGCCC ACCAGCCTTG TCCTAATAAA 
ATTAAGTTGC ATCATTTTGT CTGACTAGGT GTCCTCTATA 
ATATTATAAG CTTGATATCG AATTCTTTCT CAACGTAACA 
CTTTACAGCG GCGCGTCATT TGATATGATG CGCCCCGCTT 
CCCGATAAGG GAGCAGGCCA GTAAAAGCAT TACCCGTGGT 
GGGGTTCCCG AGCGGCCAAA GGGAGCAGAC TCTAAATCTG 
CCGTCATCGA CTTCGAAGGT TCGAATCCTT CCCCCACCAC 
CATCACTTTC AAAAGTCCGA AAGAATTCCT GCAGCCCGTG 
TAGCCGTAGT TAGGCCACCA CTTCAAGAAC TCTGTAGCAC 
CGC CTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 
TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC 
TCAAGACGAT AGTTACCGGA TAAGGCGCAG CGGTCGGGCT 
GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC 
GACCTACACC GAACTGAGAT ACCTACAGCG TGAGCATTGA 
GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT 
ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG 
GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT 
GTCGGGTTTC GCCACCTCTG ACTTGAGCGT CGATTTTTGT 
GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG 
CAACGCGGCC GGGGGATCCG GAGAGCTCAC TCTAGATGAG 
AGAGCAGTGA GGGAGAGACA GAGACT CGAA TTTCCGGAGC 
TATTTCAGTT TTCTTTTCCG TTTTGTGCAA TTTCACTTAT 
GAT AC CGGC C AATGCTTGGT TGCTATTTTG GAAACTCCCC 
TTAGGGGATG CCCCTCAACT GGCCCTATAA AGGGCCAGCC 
TGAGCTGCAG AGGATTCCTG C AG AGG AT C A AGACAGCACG 
TGGACCTCGC ACAGCCTCTC C C AC AGGT AC C 



(2) INFORMATION FOR SEQ ID NO: 19: 



40 
80 
120 
160 
200 
240 
280 
320 
360 
400 
440 
480 
520 
560 
600 
640 
680 
720 
760 
800 
840 
880 
920 
960 
1000 
1040 
1080 
1120 
1160 
1200 
1240 
1280 
1320 
13 6 0 
1400 
1440 
1480 
1520 
1560 
1600 
1640 
1680 
1720 
1760 
1800 
1840 
1880 
1911 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION : SEQ ID NO: 19: 



Pro Asp Leu Ser Tyr Met Pro He Trp Lys Phe Pro 

1 5 10 

Asp Glu Glu Gly Ala Cys Gin Pro Cys Pro He Asn 

Cvs Thr His Ser Cys Val Asp Leu Asp Asp Lys Gly 

25 30 35 

Cys Pro Ala Glu Gin Arg Ala Ser Pro Leu Thr Ser 

40 45 
He He Ser Ala Val Val Gly He Leu Leu Val Val 
50 55 60 

Val Leu Gly Val Val Phe Gly He Leu 

65 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 287 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 



Pro Ala Pro Gly 
1 

His Arg Ser Ser 
15 

Leu Thr Leu Gly 
25 

Pro Arg Ser Pro 
40 

Ser Asp Val Phe 
50 

Ala Lys Gly Leu 

Pro Leu Gin Arg 
75 

Leu Pro Ser Glu 
85 

Thr Cys Ser Pro 
100 

Asp Val Arg Pro 
110 

Leu Pro Ala Ala 

Arg Pro Lys Thr 
135 



Ala Gly Gly Met 
5 

Ser Thr Arg Ser 
20 

Leu Glu Pro Ser 
30 

Leu Ala Pro Ser 

Asp Gly Asp Leu 
55 

Ser Leu Pro Thr 
65 

Tyr Ser Glu Asp 
80 

Thr Asp Gly Tyr 
90 

Gin Pro Glu Tyr 

Pro Pro Ser Pro 
115 

Arg Pro Ala Gly 
125 

Leu Ser Pro Gly 
140 



Val His His Arg 
10 

Gly Gly Gly Asp 

Glu Glu Glu Ala 
35 

Glu Gly Ala Gly 
45 

Gly Met Gly Ala 
60 

His Asp Pro Ser 
70 

Pro Thr Val Pro 

Val Ala Pro Leu 
95 

Val Asn Gin Pro 
105 

Arg Glu Gly Pro 
120 

Ala Thr Leu Glu 
130 

Lys Asn Gly Val 
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Val Lys Asp Val 
145 

Asn Pro Glu Tyr 
160 

Pro Gin Pro Glu 
170 

Pro Gin Pro Pro 

Ala Ala Arg Pro 
195 

Lys Leu Ser Pro 
205 

Val Phe Ala Phe 
220 

Tyr Leu Thr Pro 
230 

His Pro Pro Pro 

Leu Tyr Tyr Trp 
255 

Pro Pro Ser Thr 
260 

Asn Pro Glu Tyr 
280 



Phe Ala Phe Gly 
150 

Leu Thr Pro Gin 

Tyr Val Asn Gin 
175 

Ser Pro Arg Glu 
185 

Ala Gly Ala Thr 
200 

Gly Lys Asn Gly 
210 

Gly Gly Ala Val 

Gin Gly Gly Ala 
235 

Ala Phe Ser Pro 
245 

Asp Asp Pro Pro 
260 

Phe Lys Gly Thr 
270 

Leu Gly Leu Asp 



Gly Ala Val Glu 
155 

Gly Thr Cys Ser 
165 

Pro Asp Val Arg 
180 

Gly Pro Leu Pro 
190 

Leu Glu Arg Pro 

Val Val Lys Asp 
215 

Glu Asn Pro Glu 
225 

Ala Pro Gin Pro 
240 

Ala Phe Asp Asn 
250 

Glu Arg Gly Ala 

Pro Thr Ala Glu 
275 

Val Pro Val 
285 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:21: 



He He Ser Ala Val Val Gly He Leu Leu Val Val 
1 5 10 

Val Leu Gly Val Val Phe Gly He Leu He 
15 20 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2125 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:22: 

GCCACCATGG CCCCTGACCT CTCCTACATG CCCATCTGGA 40 

AGTTTCCAGA TGAGGAGGGC GCATGCCAGC CTTGCCCCAT 80 

CAACTGCACC CACTCCTGTG TGGACCTGGA TGACAAGGGC 120 

TGCCCCGCCG AGCAGAGAGC CAGCCCTCTG ACGTCCATCA 16 0 

TCTCTGCGGT GGTTGGCATT CTGCTGGTCG TGGTCTTGGG 200 

GGTGGTCTTT GGGATCCTCA TCAAGCGACG GCAGCAGAAG 240 

AT CACATGTC CAGACCCTGC CCCGGGCGCT GGGGGCATGG 280 

TCCACCACAG GCACCGCAGC TCATCTACCA GGAGTGGCGG 320 

TGGGGACCTG ACACTAGGGC TGGAGCCCTC TGAAGAGGAG 360 

GCCCCCAGGT CTCCACTGGC ACCCTCCGAA GGGGCTGGCT 400 

C CGATGT ATT TGATGGTGAC CTGGGAATGG GGGCAGCCAA 440 

GGGGCTGCAA AGCCTCCCCA CACATGACCC CAGCCCTCTA 480 

CAGCGGTACA GTGAGGACCC CACAGTACCC CTGCCCTCTG 52 0 

AGACTGATGG CTACGTTGCC CCCCTGACCT GCAGCCCCCA 560 

GC CTGAATAT GTGAACCAGC CAGATGTTCG GCCCCAGCCC &0U 

CCTTCGCCCC GAGAGGGCCC TCTGCCTGCT GCCCGACCTG 640 

CTGGTGCCAC TCTGGAAAGG CCCAAGACTC TCTCCCCAGG 6 80 

GAAGAATGGG GTCGTCAAAG ACGTTTTTGC CTTTGGGGGT 72 0 

GCCGTGGAGA ACCCCGAGAC TTGACACCCC AGGGAGGAGC 760 

TGCCCCTCAG CCCCACCCTC CTCCTGCCTT CAGCCCAGCC 800 

TTCGACAACC TCTATTACTG GGACCAGGAC CCACCAGAGC 840 

GGGGGGCTCC ACCCAGCACC TTCAAAGGGA CACCTACGGC 880 

AGAGAACCCA GAGTACCTGG GTCTGGACGT GCCAGTGTGA 920 

AGCCTTAAGG GCCATATGGT GAGTGGATGC CTTGACCCCA 960 

GGCGGGGATG GGGGAGACCT GTAGTCAGAG CCCCCGGGCA 1000 

GCACAGGCCA ATGCCCGTCC TTCCCCTGCA GTGAGTAGTG 1040 

ACTGCCCGGG TGGGATCCCT GTGACCCCTC CCCAGTGCCT 1080 

CTCCTGGCCC TGGAAGTTGC CACTCCAGTG CCCACCAGCC 1120 

TTGTCCTAAT AAAATTAAGT TGCATCATTT TGTCTGACTA 1160 

GGTGTCCTCT ATAATATTAT AAGCTTGATA TCGAATTCTT 1200 

TCTCAACGTA ACACTTTACA GCGGCGCGTC ATTTGATATG 1240 

ATGCGCCCCG CTTCCCGATA AGGGAGCAGG CCAGTAAAAG 1280 

CATTACCCGT GGTGGGGTTC CCGAGCGGCC AAAGGGAGCA 1320 

GACTCTAAAT CTGCCGTCAT CGACTTCGAA GGTTCGAATC 1360 

CTTCCCCCAC CACCATCACT TTCAAAAGTC CGAAAGAATT 1400 

CCTGCAGCCC GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 1440 

AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC 1480 

TGTTACCAGT GGCTGCTGCC AGTGGCGATA AGTCGTGTCT 1520 

TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG 1560 

CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA 1600 

GCTTGGAGCG AACGACCTAC ACCGAACTGA GATACCTACA 1640 

GCGTGAGCAT TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 1680 

AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG 172 0 

GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGC CTGGT A 1760 

T CTTT ATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 1800 

CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT 1840 

GGAAAAACGC CAGCAACGCG GCCGGGGGAT CCGGAGAGCT 1880 

CACTCTAGAT GAGAGAGCAG TGAGGGAGAG ACAGAGACTC 192 0 

GAATTTCCGG AGCTATTTCA GTTTTCTTTT CCGTTTTGTG I960 
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CAATTTCACT TATGATACCG GCCAATGCTT GGTTGCTATT 2 0 00 

TTGGAAACTC CCCTTAGGGG ATGCCCCTCA ACTGGCCCTA 2040 

T AAAGGGC C A GCCTGAGCTG C AG AGG ATT C CTGCAGAGGA 2 08 0 

TCAAGACAGC ACGTGGACCT CGCACAGCCT CTCCCACAGG 212 0 
TACCT 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: oligonucleotide 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 23: 
GTCTGCCACC ATGGCCTACT CCCCTGC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 24: 
TTCTTTGGTG ACCTACCTCT TCGGAATTGC CGAGTC 36 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 25: 

ATGGAGGAGC CGCAGTCAGA TCCTAGCGTC GAGCCCCCTC 4 0 

TGAGTCAGGA AACATTTTCA GACCTATGGA AACTACTTCC 80 

TGAAAACAAC GTTCTGTCCC CCTTGCCGTC CCAAGCAATG 120 

GATGATTTGA TGCTGTCCCC GGACGATATT GAACAATGGT 16 0 

TCACTGAAGA CCCAGGTCCA GATGAAGCTC CCAGAATGCC 200 

AGAGGCTGCT CCCCGCGTGG CCCCTGCACC AGCAGCTCCT 240 

ACACCGGCGG CCCCTGCACC AGCCCCCTCC TGGCCCCTGT 280 

CATCTTCTGT CCCTTCCCAG AAAACCTACC AGGGCAGCTA -*2U 

f*. CGGTTTCCGT CTGGGCTTCT TGCATTCTGG GACAGCCAAG 360 

TCTGCCACCA TGGCCTACTC CCCTGCGTCT GTGACTTGCA 400 

21 CGTACTCCCC TGCCCTCAAC AAGATGTTTT GCCAACTGGC 440 

! * CAAGACCTGC CCTGTGCAGC TGTGGGTTGA TTCCACACCC 480 

f CCGCCCGGCA CCCGCGTCCG CGCCATGGCC ATCTACAAGC 520 

HJ AGTCACAGCA CATGACGGAG GTTGTGAGGC GCTGCCCCCA 56 0 

rU CCATGAGCGC TGCTCAGATA GCGATGGTCT GGCCCCTCCT 600 

p CAGCGTCTTA TCCGAGTGGA AGGAAATTTG CGTGTGGAGT 640 

Hi ATTTGGATGA CAGAAACACT TTTCGACATA GTGTGGTGGT 680 

GCCCTATGAG CCGCCTGAGG TTGGCTCTGA CTGTACCACC 720 

L= ATCCACTACA ACTACATGTG TAACAGTTCC TGCATGGGCG 76 0 

GCATGAACCG GAGGCCCATC CTCACCATCA TCACACTGGA 800 

^ AGACTCCAGT GGTAATCTAC TGGGACGGAA CAGCTTTGAG 840 

Q GTGCGTGTTT GTGCCTGTCC TGGGAGAGAC CGGCGCACAG 880 

M AGGAAGAGAA TCTCCGCAAG AAAGGGGAGC CTCACCACGA 920 

u3 GCTGCCCCCA GGGAGCACTA AGCGAGCACT GCCCAACAAC 960 

,« ACCAGCTCCT CTCCCCAGCC AAAGAAGAAA CCACTGGATG 1000 

GAGAATATTT CACCCTTCAG ATCCGTGGGC GTGAGCGCTT 1040 

CGAGATGTTC TTTGGTGACC TACCTCTTCG GAATTGCCGA 1080 

GTCTTCCGAG AGCTGAATGA GGCCTTGGAA CTCAAGGATG 1120 

CCCAGGCTGG GAAGGAGCCA GGGGGGAGCA GGGCTCACTC 116 0 

CAGCCACCTG AAGTCCAAAA AGGGTCAGTC TACCTCCCGC 1200 

CATAAAAAAC TCATGTTCAA GACAGAAGGG CCTGACTCAG 1240 

AC 1242 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 
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(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 26 



40 
80 
120 
160 
200 
240 
280 
320 
360 
400 
440 
480 
520 



CTCGGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC 
CCCTGACGAG CAT C ACAAAA ATCGACGCTC AAGTCAGAGG 
TGGCGAAACC CGACAGGACT ATAAAGATAC CAGGCGTTTC 
CCCCTGGAAG CTCCCTCGTG CGCTCTCCTG TTCCGACCCT 
GCCGCTTACC GGATACCTGT CCGCCTTTCT CCCTTCGGGA 
AGCGTGGCGC TTTCTCAATG CTCACGCTGT AGGTATCTCA 
GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA 
CGAACCCCCC GTTCAGCCCG ACCGCTGCGC CTTATCCGGT 
AACTATCGTC TTGAGTCCAA CCCGGTAAGA CACGACTTAT 
CGCCACTGGC AGCAGCCACT GGTAACAGGA TTAGCAGAGC 
GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG 
CCTAACTACG GCTACACTAG AAGGACAGTA TTTGGTATCT 
GCGCTCTGCT GAAGCCAGTT ACCTTCGGAA AAAGAGTTGG 
TAGCTCTTGA TCCGGCAAAC AAACCACCGC TGGTAGCGGT 560 
GGTTTTTTTG TTTGCAAGCA GCAGATTACG CGCAGAAAAA j>oo 
AAGGATCT 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 27: 



GGTACCTGCC ACCATGGCGC GGATTCTTTA TCACTGATAA 40 
GTTGGTGGAC ATATTATGTT TATCAGTGAT AAAGTGT C AA 8 0 

GCATGACAAA GTTGCAGCCG AATACAGTGA TCCGTGCCGG 12 0 

CCCTGGACTG TTGAACGAGG TCGGCGTAGA CGGTCTGACG 160 

ACACGCAAAC TGGCGGAACG GTTGGGGGTG CAGCAGCCGG 200 

CGCTTTACTG GCACTTCAGG AACAAGCGGG CGCCTTAAGG 240 

GCCATATGGT GAGTGGATGC CTTGACCCCA GGCGGGGATG 280 

GGGGAGACCT GTAGTCAGAG CCCCCGGGCA GCACAGGCCA 320 

ATGCCCGTCC TTCCCCTGCA GGATGAGTAG TGAGTGCCTC 3 60 

TCCTGGCCCT GGAAGTTGCC ACTCCAGTGC CCACCAGCCT 400 

TGTCCTAATA AAATTAAGTT GCATCATTTT GTCTGACTAG 440 

GTGTCCTCTA TAATATTATA AGCTTGATAT CGAATTCTTT 480 

CGGACTTTTG AAAGTGATGG TGGTGGGGGA AGGATTCGAA 520 

CCTTCGAAGT CGATGACGGC AGATTTAGAG TCTGCTCCCT 560 

TTGGCCGCTC GGGAACCCCA CCACGGGTAA TGCTTTTACT 600 



GGCCTGCTCC CTTATCGGGA AGCGGGGCGC ATCATATCAA 64 0 

ATGACGCGCC GCTGTAAAGT GTTACGTTGA GAAAGAATTC 680 
CTGCAGCCCG CCGCGTTGCT GGCGTTTTTC CATAGGCTCC 720 
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GCCCCCCTGA CGAGCATCAC AAAAATCGAC 
GAGGTGGCGA AAC CCGAC AG GACTATAAAG 
TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT 
CCCTGCCGCT T AC CGG AT AC CTGTCCGCCT 
GGGAAGCGTG GCGCTTTCTC AATGCTCACG 
CTCAGTTCGG TGTAGGTCGT TCGCTCCAAG 
TGCACGAACC CCCCGTTCAG CCCGACCGCT 
CGGTAACTAT CGTCTTGAGT CCAACCCGGT 
TTATCGCCAC TGGCAGCAGC CACTGGTAAC 
GAGCGAGGTA TGTAGGCGGT GCTACAGAGT 
GTGGCCTAAC TACGGCTACA CTAGAAGGAC 
ATCTGCGCTC TGCTGAAGCC AGTTACCTTC 
TTGGTAGCTC TTGATCCGGC AAACAAACCA 
CGGTGGTTTT TTTGTTTGCA AGCAGCAGAT 
AAAAAAGGAT CTGGGGGATC CGGAGAGCTC 
AGAGAGCAGT GAGGGAGAGA CAGAGACTCG 
GCTATTTCAG TTTTCTTTTC CGTTTTGTGC 
ATGAT AC CGG CCAATGCTTG GTTGCTATTT 
CCTTAGGGGA TGCCCCTCAA CTGGCCCTAT 
CCTGAGCTGC AGAGGATTCC TGCAGAGGAT 
CGTGGACCTC GCACAGCCTC TCCCACA 




GCTCAAGTCA 76 0 

AT AC CAGG CG 8 00 

CCTGTTCCGA 84 0 

TTCTCCCTTC 880 

CTGTAGGTAT 92 0 

CTGGGCTGTG 96 0 

GCGCCTTATC 10 0 0 

AAGACACGAC 104 0 

AGGATTAGCA 1080 

TCTTGAAGTG 112 0 

AGTATTTGGT 116 0 

GGAAAAAGAG 12 0 0 

CCGCTGGTAG 124 0 

TACGCGCAGA 128 0 

ACTCTAGATG 13 2 0 

AATTTCCGGA 13 6 0 

AATTTCACTT 14 0 0 

TGGAAACTCC 144 0 

AAAGGGC C AG 14 8 0 

CAAGACAGCA 152 0 

1547 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 28: 



GGTACCTGCC ACCATGGCGC GGATTCTTTA 
GTTGGTGGAC ATATTATGTT TAT C AGTG AT 
GCATGACAAA GTTGCAGCCG AATACAGTGA 
CCCTGGACTG TTGAACGAGG TCGGCGTAGA 
ACACGCAAAC TGGCGGAACG GTTGGGGGTG 
CGCTTTACTG GCACTTCAGG AACAAGCGGG 
GCCATATGGT GAGTGGATGC CTTGACCCCA 
GGGGAGACCT GTAGTCAGAG CCCCCGGGCA 
ATGCCCGTCC TTCCCCTGCA GGATGAGTAG 
TCCTGGCCCT GGAAGTTGCC ACTCCAGTGC 
TGTCCTAATA AAATTAAGTT GCATCATTTT 
GTGTCCTCTA TAATATTATA AGCTTGATAT 
CGGACTTTTG AAAGTGATGG TGGTGGGGGA 
CCTTCGAAGT CGATGACGGC AGATTTAGAG 
TTGGCCGCTC GGGAACCCCA CCACGGGTAA 
GGCCTGCTCC CTTATCGGGA AGCGGGGCGC 
ATGACGCGCC GCTGTAAAGT GTTACGTTGA 



TCACTGATAA 4 0 

AAAGTGTCAA 80 

TCCGTGCCGG 12 0 

CGGTCTGACG 16 0 

CAGCAGCCGG 2 00 

CGCCTTAAGG 24 0 

GGCGGGGATG 280 

GCACAGGCCA 32 0 

TGAGTGCCTC 3 60 

CCACCAGCCT 400 

GTCTGACTAG 440 

CGAATTCTTT 480 

AGGATTCGAA 52 0 

TCTGCTCCCT 560 

TGCTTTTACT 60 0 

AT CAT ATC AA 64 0 

GAAAGAATTC 6 80 
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CTGCAGCCCG CCGCGTTGCT GGCGTTTTTC CATAGGCTCC 720 

GCCCCCCTGA CGAGCATCAC AAAAATCGAC GCTCAAGTCA 760 

GAGGTGGCGA AACCCGACAG GACTATAAAG ATACCAGGCG 800 

TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT CCTGTTCCGA 840 

CCCTGCCGCT TACCGGATAC CTGTCCGCCT TTCTCCCTTC 880 

GGGAAGCGTG GCGCTTTCTC AATGCTCACG CTGTAGGTAT 920 

CTCAGTTCGG TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG 960 

TGCACGAACC CCCCGTTCAG CCQ£5ACCGCT GCGCCTTATC 1000 

CGGTAACTAT CGTCTTGAGT CCAACCCGGT AAGACACGAC 1040 

TTATCGCCAC TGGCAGCAGC CACTGGTAAC AGGATTAGCA 1080 

GAGCGAGGTA TGTAGGCGGT GCTACAGAGT TCTTGAAGTG 1120 

GTGGCCTAAC TACGGCTACA CTAGAAGGAC AGTATTTGGT 116 0 

ATCTGCGCTC TGCTGAAGCC AGTTACCTTC GGAAAAAGAG 1200 

TTGGTAGCTC TTGATCCGGC AAACAAACCA CCGCTGGTAG 1240 

CGGTGGTTTT TTTGTTTGCA AGCAGCAGAT TACGCGCAGA 1280 

AAAAAAGGAT CTGGGGGATC CGGAGAGCTC CCAACGCGTT 1320 

P GGATGCATGG ATGAGGGAAA GGAGGTAAGA TCTGTAATGA 1360 

£ ATAAGCAGGA ACTTTGAAGA CTCAGTGACT CAGTGAGTAA 1400 

m TAAAGACTCA GTGACTTCTG ATCCTGTCCT AACTGCCACT 1440 

£ CCTTGTTGTC C C AAGAAAGC GGCTTCCTGC TCTCTGAGGA 1480 

= .= GGACCCCTTC CCTGGAAGGT AAAACTAAGG ATGTCAGCAG 1520 

Lr; AGAAATTTTT CCACCATTGG TGCTTGGTCA AAGAGGAAAC 1560 

IS TGATGAGCTC ACTCTAGATG AGAGAGCAGT GAGGGAGAGA 1600 

" CAGAGACTCG AATTTCCGGA GCTATTTCAG TTTTCTTTTC 1640 

rU CGTTTTGTGC AATTTCACTT ATGATACCGG CCAATGCTTG 1680 

' GTTGCTATTT TGGAAACTCC CCTTAGGGGA TGCCCCTCAA 1720 

i-* CTGGCCCTAT AAAGGGCCAG CCTGAGCTGC AGAGGATTCC 176 0 

}•& TGCAGAGGAT CAAGACAGCA CGTGGACCTC GCACAGCCTC 1800 
•Q TCCCACA 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
( i i i ) HYPOTHETICAL : No 
(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:29: 



1807 



GGTACCTGCC AC C ATGGCGA AGGGCGAGGA ACTGTTCACT 
GGCGTGGTCC CAATTCTCGT GGAACTGGAT GGCGATGTGA 
ATGGGCACAA ATTTTCTGTC AGCGGAGAGG GTGAAGGTGA 
TGCCACATAC GGAAAGCTCA CCCTGAAATT CATCTGCACC 
ACTGGAAAGC TCCCTGTGCC ATGGCCAACA CTGGT C ACT A 
CCTTCACCTA TGGCGTGCAG TGCTTTTCCA GATACCCAGA 
CCATATGAAG CAGCATGACT TTTTCAAGAG CGCCATGCCC 
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GAGGGCTATG 


m/"T TV TA TV 

TGCAGGAGAG 


AACCATCTTT 


TTCAAAGATG 


320 




"TV r~*f~~* TV 7\ /~1IT^ TV 

ACGGGAACTA 


CAAGACCCGC 


GCTGAAGTCA 


AGTTCGAAGG 


360 




1 GACACCCTG 


/""*rn/"*» TV TV m TV TV TV 

GTGAATAGAA 


TCGAGTTGAA 


GGGCATTGAC 


400 




1 1 IAAGGAAG 


TV nri/""*/""* TV TV TV /i TV m 

ATGGAAACAT 


TCTCGGCCAC 


AAG CTGGAAT 


440 




TV /"""I TV TV i^*"Xt TV m TV TV 

ACAACTATAA 


CTCCCACAAT 


GTGTACATCA 


TGGC CGACAA 


480 




/~1 /""I 7V 7V 7V 7\ /"* TV TV m 

GCAAAAGAAT 


GGCAT CAAGG 


TCAACTTCAA 


GATCAGACAC 


520 




TV tv f \ tv mm/™* tv rr/T 

AACATTGAGG 


ATGGAT CCGT 


GCAGCTGGCC 


GACCATTATC 


560 




TV TV /""I 7\ /"I 7V TV /T. TV /""I 

AACAGAACAC 


TCCAATCGGC 


GACGGCCCTG 


TGCTCCTCCC 


600 




TV /""i TV /"*» TV TV /""I TV rr» 

AGACAACCAT 


TACCTGTCCA 


CCCAGTCTGC 


CCTGTCTAAA 


640 




GATCCCAACG 


AAAAGAGAGA 


CCACATGGTC 


CTGCTGGAGT 


680 




TTGTG AC CG C 


TGCTGGGATC 


ACACATGGCA 


TGGACGAGCT 


720 




/~»rri TV y*t TV in yim/~i tv 

GTACAAGTGA 


GCGCCTTAAG 


GGCCATATGG 


TGAGTGGATG 


760 




CCTTGACCCC 


AGGCGGGGAT 


GGGGGAGACC 


TGTAGT C AGA 


800 




GCCCCCGGGC 


AGCACAGGCC 


AATGCCCGTC 


CTTCCCCTGC 


840 




AGGATGAGTA 


GTGAGTGCCT 


CTCCTGGCCC 


TGGAAGTTGC 


880 




CACT C C AGTG 


CCCACCAGCC 


TTGTCCTAAT 


AAAATTAAGT 


920 


TGCATCATTT 


TGTCTGACTA 


GGTGTCCTCT 


ATAATATTAT 


960 




AAGCTTGATA 


TCGAATTCTT 


TCGGACTTTT 


GAAAGTGATG 


1000 


ri : 


GTGGTGGGGG 


AAGGATTCGA 


ACCTTCGAAG 


TCGATGACGG 


1040 




CAGATTTAGA 


GTCTGCTCCC 


TTTGGCCGCT 


CGGGAACCCC 


1080 


ft 1 


ACCACGGGTA 


ATGCTTTTAC 


TGGCCTGCTC 


CCTTATCGGG 


1120 




AAGCGGGG CG 


CAT C AT ATC A 


AATGACGCGC 


CGCTGTAAAG 


1160 


! L| 


TGTTACGTTG 


AGAAAGAATT 


CCTGCAGCCC 


GCCGCGTTGC 


1200 




TGGCGTTTTT 


CCATAGGCTC 


CGCCCCCCTG 


ACGAGCATCA 


1240 


E L-f 


CAAAAAT CGA 


CGCTCAAGTC 


AGAGGTGGCG 


AAACCCGACA 


1280 


a 


GGACTATAAA 


GATACCAGGC 


GTTTCCCCCT 


GGAAGCTCCC 


1320 




TCGTGCGCTC 


TCCTGTTCCG 


ACCCTGCCGC 


TTACCGGATA 


1360 




CCTGTCCGCC 


TTTCTCCCTT 


CGGGAAG CGT 


GGCGCTTTCT 


1400 




CAATGCTCAC 


GCTGTAGGTA 


TCTCAGTTCG 


GTGTAGGTCG 


1440 




TTCGCTCCAA 


GCTGGGCTGT 


GTGCACGAAC 


CCCCCGTTCA 


1480 




GCCCGACCGC 


TGCGCCTTAT 


CCGGTAACTA 


TCGTCTTGAG 


1520 




TCCAACCCGG 


TAAGACACGA 


CTTATCGCCA 


CTGGCAGCAG 


1560 




CCACTGGTAA 


CAGGATTAGC 


AGAGCGAGGT 


ATGTAGGCGG 


1600 




TGCTACAGAG 


TTCTTGAAGT 


GGTGGCCTAA 


CTACGGCTAC 


1640 




TV y™t^n TV /"""I tv T\ yD tv 

ACTAGAAGGA 


CAGTATTTGG 


TATCTGCGCT 


CTGCTGAAGC 


1680 




C AGTT AC CTT 


CGGAAAAAGA 


GTTGGTAGCT 


CTTGATCCGG 


1720 




/"l TV TV TV /"■* "A TV TV /""I/"! 

C AAACAAAC C 


ACCGCTGGTA 


GCGGTGGTTT 


TTTTGTTTGC 


1760 




7V TV t~\ 7\ r** TV TV 

AAGCAGCAGA 


mm TV nri /— <i r^l y~« tv /<i 

TTACGCGCAG 


AAAAAAAGGA 


TCTGGGGGAT 


1800 




CCGGAGAGCT 


CCCAACGCGT 


TGGATGCATG 


GATGAGGGAA 


1840 




Ti/^T'TV r-rt TV TV /"l 

AGGAGGTAAG 


Tv m/^im/^imT* tv m 

ATCTGTAATG 


AATAAGCAGG 


AACTTTGAAG 


1880 




TV /^•rri/™* TV mri TV /™t 

ACT C AGTGAC 


m/i tv y™i m /~i tv /~« m tv 

T CAGTGAGTA 


ATAAAGACT C 


AGTGACTTCT 


1920 










(~*/~*f~1 TV TV TV TV TV /'"I 

C C C AAG AAAG 


1960 




CGGCTTCCTG 


CTCTCTGAGG 


AGGACCCCTT 


CCCTGGAAGG 


2000 




TAAAACTAAG 


GATGTCAGCA 


GAGAAATTTT 


TCCACCATTG 


2040 




GTGCTTGGTC 


AAAGAGGAAA 


CTGATGAGCT 


CACTCTAGAT 


2080 




GAGAGAGCAG 


TGAGGGAGAG 


ACAGAGACTC 


GAATTTCCGG 


2120 




AGCTATTTCA 


GTTTTCTTTT 


CCGTTTTGTG 


CAATTTCACT 


2160 




TATGATACCG 


GCCAATGCTT 


GGTTGCTATT 


TTGGAAACTC 


2200 




CCCTTAGGGG 


ATGCCCCTCA 


ACTGGCCCTA 


TAAAGGGCCA 


2240 




GCCTGAGCTG 


CAGAGGATTC 


CTG C AGAGGA 


TCAAGACAGC 


2280 




ACGTGGACCT 


CGCACAGCCT 


CTCCCACA 




2308 
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